Fuzzy Clustering for Finding Fuzzy Partitions of Many-Valued Attribute Domains in a Concept Analysis Perspective
نویسندگان
چکیده
Although an overall knowledge discovery process consists of a distinct pre-processing stage followed by the data mining step, it seems that existing formal concept analysis (FCA) and association rules mining (ARM) approaches, dealing with many-valued contexts, mainly focus on the data mining stage. An “intelligent” pre-processing of input contexts is often absent in existing FCA/ARM approaches, leading to an unavoidable information loss. Usually, many-valued attribute domains need to be first fuzzily partitioned. However, it is unrealistic that the most appropriate fuzzy partitions can be provided by domain experts. In this paper, an unsupervised learning stage, based on Fuzzy C-Means algorithm, is proposed in order to get fuzzy partitions that are faithful to data for quantitative attribute domains, and consequently for avoiding the loss of valuable association rules due to the use of empirical fuzzy partitions. More precisely, the paper reports an experiment where it is shown that some rules are no longer found because their support or confidence is too low when using such empirical partitions. Experimental results show that the learned fuzzy partition outperforms human expert fuzzy partitions. More generally, the paper provide discussions about the handling of many-valued attributes in both fuzzy FCA and fuzzy ARM. Keywords— Many-valued formal contexts, fuzzy partitions, fuzzy C-means, association rules.
منابع مشابه
A Framework for Optimal Attribute Evaluation and Selection in Hesitant Fuzzy Environment Based on Enhanced Ordered Weighted Entropy Approach for Medical Dataset
Background: In this paper, a generic hesitant fuzzy set (HFS) model for clustering various ECG beats according to weights of attributes is proposed. A comprehensive review of the electrocardiogram signal classification and segmentation methodologies indicates that algorithms which are able to effectively handle the nonstationary and uncertainty of the signals should be used for ECG analysis. Ex...
متن کاملDetermining Fuzzy Sets for Quantitative Attributes in Data Mining Problems
The problem of mining association rules for fuzzy quantitative items was introduced and an algorithm proposed in [5]. However, the algorithm assumes that fuzzy sets are given. In this paper we propose a method to find the fuzzy sets for each quantitative attribute in a database by using clustering techniques. We present a scheme for finding the optimal partitioning of a data set during the clus...
متن کاملA Fuzzy C-means Algorithm for Clustering Fuzzy Data and Its Application in Clustering Incomplete Data
The fuzzy c-means clustering algorithm is a useful tool for clustering; but it is convenient only for crisp complete data. In this article, an enhancement of the algorithm is proposed which is suitable for clustering trapezoidal fuzzy data. A linear ranking function is used to define a distance for trapezoidal fuzzy data. Then, as an application, a method based on the proposed algorithm is pres...
متن کاملAssessment of distance-based multi-attribute group decision-making methods from a maintenance strategy perspective
Maintenance has been acknowledged by industrial management as a significant influencing factor of plant performance. Effective plant maintenance can be realized by developing a proper maintenance strategy. However, selecting an appropriate maintenance strategy is difficult because maintenance is a non-repetitive task such as production activity. Maintenance also does not leave a consistent trac...
متن کاملArithmetic Aggregation Operators for Interval-valued Intuitionistic Linguistic Variables and Application to Multi-attribute Group Decision Making
The intuitionistic linguistic set (ILS) is an extension of linguisitc variable. To overcome the drawback of using single real number to represent membership degree and non-membership degree for ILS, the concept of interval-valued intuitionistic linguistic set (IVILS) is introduced through representing the membership degree and non-membership degree with intervals for ILS in this paper. The oper...
متن کامل